Optimal query/update tradeoffs in versioned dictionaries
نویسندگان
چکیده
External-memory dictionaries are a fundamental data structure in file systems and databases. Versioned (or fullypersistent) dictionaries have an associated version tree where queries can be performed at any version, updates can be performed on leaf versions, and any version can be ‘cloned’ by adding a child. Various query/update tradeoffs are known for unversioned dictionaries, many of them with matching upper and lower bounds. No fully-versioned external-memory dictionaries are known with optimal space/query/update tradeoffs. In particular, no versioned constructions are known that offer updates in o(1) I/Os using O(N) space. We present the first cache-oblivious and cache-aware constructions that achieve a wide range of optimal points on this tradeoff. General Terms Cache-oblivious algorithms, External-memory algorithms, Versioned data structures
منابع مشابه
Stratified B-trees and versioning dictionaries
A classic versioned data structure in storage and computer science is the copy-on-write (CoW) B-tree – it underlies many of today’s file systems and databases, including WAFL, ZFS, Btrfs and more. Unfortunately, it doesn’t inherit the B-tree’s optimality properties; it has poor space utilization, cannot offer fast updates, and relies on random IO to scale. Yet, nothing better has been developed...
متن کاملStratified B-trees and Versioned Dictionaries
External-memory versioned dictionaries are fundamental to file systems, databases and many other algorithms. The ubiquitous data structure is the copy-onwrite (CoW) B-tree. Unfortunately, it doesn’t inherit the B-tree’s optimality properties; it has poor space utilization, cannot offer fast updates, and relies on random IO to scale. We describe the ‘stratified B-tree’, which is the first versio...
متن کاملDynamic Rank/Select Dictionaries with Applications to XML Indexing
We consider a central problem in text indexing: Given a text T over an alphabet Σ, construct a compressed data structure answering the queries char (i), rank s(i), and selects(i) for a symbol s ∈ Σ. Many data structures consider these queries for static text T [GGV03, FM01, SG06, GMR06]. We consider the dynamic version of the problem, where we are allowed to insert and delete symbols at arbitra...
متن کاملTradeoffs for nearest neighbors on the sphere
We consider tradeoffs between the query and update complexities for the (approximate) nearest neighbor problem on the sphere, extending the spherical filters recently introduced by [Becker–Ducas–Gama– Laarhoven, SODA’16] to sparse regimes and generalizing the scheme and analysis to account for different tradeoffs. In a nutshell, for the sparse regime the tradeoff between the query complexity nq...
متن کاملTVCL - Temporal Versioned Constraint Language
This paper proposes the Temporal Versioned Constraint Language (TVCL), which provides a declarative and non ambiguous way to user specification of constraints for temporal databases with versions support, based on the Temporal Versions Model (TVM). This language gives support to time and versions characteristics of data and of the constraints themselves, allowing the definition of constraints v...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1103.2566 شماره
صفحات -
تاریخ انتشار 2011